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A  technique  is  described  for  characterizing  arbitrary,  simply  connected 
plane  figures  by  a  set  of  numbers  derived  jfrom  the  shape  of  their  outlines. 

The  basis  of  the  description  is  taken  to  tie  the  slope  of  the  contour  as  a 
function  of  distance  along  the  contour.  This  function  is  expanded  in  Fourier  ; 
series,  and  the  series  coefficients  are  taken  as  a  set  of  descriptors. 

Topics  discussed  are  the  effects  of  distortion  and  approximation  of  contours, 
reconstruction  of  figures  from  their  descriptors,  and  application  of  the 
descriptors  to  pattern  recognition.  Alternates  to  the  Fourier  series  ex¬ 
pansion  of  the  slope  function  are  also  considered. 
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SOME  ASPECTS  OF  THE  TANGENT- ANGLE  WS. 
ARC-LENGTH  REPRESENTATION  OF  CONTOURS 


I.  Introduction  y 

The  significant  information.,  in  a  black-and-white  halfton^  picture  field 
can  be  represented  economically  by  a  set  of  contours  drawn  to  ^separate  areas 
nf  thp  picture  that  are  everywhere  darker  than  given  shades  of  gray  from 
a£eas  that  £e  everywhere  lighter . ^ >2 For  purposes  of  automatic  analysis 
of  pictures,  it  is  desirable  to  reduce  these  contours  to  uniform  sets  of 
descriptors,  which  may  then  ce  subjected  ta  various  general  classification 
techniques.  This  report  describes  the  tangent-angle  vs.  arc- length  rppre- 
sentation-as-TTBaHs  for  generating  descriptors  for  contours,  discusses 
certain  aspects  of  the  completeness  of  the  resulting  description  and  its 
sensitivity  to  noise,  and  presents  results  .indicative  of  its  usefulness  for 

classifying  shapes.  \ 

The  tangent-angle  vs.  arc-length  representation  of  contours  jj^s  been 

described  in  previous  reports;  >5  therefore,  only  a  hf,  tt6p : 

here.  Assume  an  arbitrary  closed  contour  (such  as  the  outline  of  the  letter 
P  in  Fig.  la).  Starting  at  an  arbitrary  point  and  going  clockwise,  plot 
Le  ana!;  between  the  directed  tangent  to  the  contour  and  a  reference  Mne 
aa  a  function  of  the  distance  from-  the  starting-  pozxrtf  measured  along-  tite  ■ 
contour.  The  resulting  function  (Fig.  lb)  is  a  complete  description  of  the 
contour  in  the  form  of  a  single- valued  function  of  one  variable.  It  should 
be^oted  that  logic  circuits  could  readily  be  built  to  construct  such  a 
description  from  the  Information  stored  in  the  magnetic  core  matrix  o 

ATRID  I  model.6  . 
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by  the  combination  of  the  phase  and  the  moan  value.  With  these 
possible  to  reconstruct  the  original  contour  exactly.  I  V  » 

the  contour  is  of  interest,  it  is  convenient  to  normalize  the  leng 

period  to  2H  radian. 
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S^n^^e  descriptor 
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A  contour  (a)  and  associated  <t>U)  curves.  Curve  (b),  result 
S T  starting  to  trace  contour  at  cross;  curve  (c)  result  of 
starting  at  dot.  Curve  (d)  is  periodic  function  obtained 
by  adding  ramp  function  to  measured  angle  b’U). 
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vectors  of  this  type  in  general  do  allow  meaningful  discrimination  between  : 
contours,  they  are  not  the  most  desirable  available,  because  in  the  absence 
of  phase  information  they  do  not  allow  even  approximate  reconstruction  of 
the  original  contours.  Phase  information , may  be  retained  in  either  of  two 

ways.  In  one,  the  Fourier  series  repire&ehting  the  contour  is  expressed  in 
polar  form.  Then  the  first  n  harmonibsof  the  isbries  lead  to  a  descriptor 
vector  of  2n  components,  of  which  onei-half  are  (measures  of  amplitude  and 
the  other  half  are  measures  of  phase.  In  the  other,  the  Fourier  series  is 
expressed  in  cosine- sine  form,  with  alternate  components  of  the  descriptor 
vector  representing  magnitudes  of  cosine  and  sine  terms.  This1  latter 
form  is  the  one  that  was  used  in  the  work  to  be  described.  Descriptor 
vectors  of  this  form  contain  the  information  necessary  to  reconstruct  the 
contours  to  an  accuracy  limited  only  by  the  number  of  harmonics  considered 
(see  Section  III).  .  \ 


II.  Approximation  of  Contours  and 

Noise  Considerations  ' 

It  is  important  to  note  that  conceptually  a  tangent-angle  vs. ,  arc- 
length  formulation -can  be  found  to  represent  exactly  any  reasonable  contour. 

In  practri^£irapplications ,  it  is  equally  important  that  adequate  approxi¬ 
mations  to  the  exact  function  can  be  obtaincdv ee -ily  and  quickly.  This 
section  discusses  techniques  for  constructing  \approximate  4>(i)  curves  and  for 
computing  their  Fourier  series  by  use  of  a  digital  computer.  The  same 
concepts  used  in  programming  the  digital  computer  to  construct  the  contours 
could  be  applied  in  designing  logic  circuits  to  extract  contour  descriptions 
from  the  information  stored  in  the  core  matrix  of  the  ATRID  model. 

In  principle  the  approximation  used  consists  of  substituting  a  contour 
made  up  of  straight-line  segments  for  the  exact  contour.  This  has  the  effect; 
of  making  the  tangent  angle  $  a  piecewise  constant  function  of  the  arc  length' 
£  (before  the  ramp  function  is  subtracted).  ’LT  L  T:i 

Because  the  approximate  boundary  consists  of  straight-line  segments,  -nv,k-i 
the  Fourier  expansion  of  its  <t>(i)  function  can  be  calculated  very  simply. 

The  tangent  Mangla  is  a  piecewise  constant  function  of  the  circumference.  -  f- 
.This  may ITe  expressed  by  •  - 


l 

♦  ’(i)  «  for  £i_1  <  £  Ji, 


with  denoting  the  angle  of  the  ith  line  segment.  Here  £0  -  0  and  is 
the  umulative  length  of  the  first  i  straight- line  segments.  Then  if  the 
contour  is  approximated  by  m  segments,  £m  equals  the  total  length  of  the 
contour  L.  I " 

The  periodic  function  4>(i)  then  is 
(2)  *(£)  =  *'(f)  +  2JI  rjr  . 
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Proceeding  with  the  Fourier  series  expansion  cf  4>(i), 


one  may  write 


(3) 


2  rL 

■^n  "  L  J  o  cos  (2ir  n  i)M  > 

or,  in  light  of  Eq.  (2), 

2  n  L  p  Ij 

(4)  An  =  I  /  4>'(i)  cos  (2H  n  7)di  +  f  f  211-  — 

°  1*  W  q  L 


cos  (2JI  n  £)d Jt  . 


f*™1  IIlFH  the™ n^f  of 'the  Vn?egrS  “lub 

intervals  L£±mxt£i ]•  One  may  then  rewrite  Eq|.  (4)  as 
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=  C°S  ^  n &¥  m4.  CDE  to -  i 
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(6) 


m 

An  “  miZ 

i-1 


>i [sin  • 


summation  of  Eq.  (6;  as  follows: 

m-1  . 


(7) 


An  -  m 


one  may  regroup  the  terms  of  the 


m-1 


i-1 


to  obtain 


m-1 


i=l 


An  y>>i“W  sin^  ^  • 


(8) 


Similarly, 


(9)  ®n  -  l/0L‘(i)  sin  (211  n  ^  di 


leading  to 

(10) 

where 

(11) 


v 

-I Z i 4  *1 sln  (2n " l) «  ♦  zf^11  i sln  (2n n V "  ’ 

i*l  1-1 
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Bn-^iZ  cos  t  +  ?y 

i®l  I 


pm+i 


a  *x  +  2n. 
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Thus  it  is  seen  that  the  Fourier  coefficients  can  be  obtained 
evaluating  sines  and  cosines  at  the  m  points  around\  the  contour  at  which  it 
changes  slope  and  by  summing  these  values,  multiplied  by  the  net  change  in 
slope  at  each  point. 


Preparatory  to  extracting  the  approximate  contour,  a  square  grid  is 
superimposed  on  the  image.  It  is  convenient  to  think  of  the  contour  not  as 
a  line  but  rather  as  the  boundary  between  two  regions'of  the  field,  which  we 
may  call  "black"  and  "white."  All  squares  which  contain  more  ihan  some 
specified  fraction  of  "black"  areS,  are  assumed  to  be  entirely  "^lack." 

Boundary  squares  are  then  defined  as  those  "black"  squares  having  at  least 
one  side  in  common  with  a  "white" -square.  Two  boundary  squares  are  considered 
adjacent  if  they  have  either  a  common  edge  or  a  common  corner.  The  boundary 
then  is  approximated  by  the  straight-line  segments  connecting  the  midpoints 
of  successive  adjacent  boundary  squares.  i 

i 

SuccessTveT~steps  in  the  process  of  approximating  a  contour  are  illus¬ 
trated  in  Fig.  2.  It  should  be  noted  that  certain  squares  may  not  be 
boundary  squares  according  to  the  definition  given,  even  though  the  actual 
boundary  (but  not, of  course,  the  approximate  one)  passes  through  them. 


Some  indication  of  how  the  coarseness  of  the  grid  affects  the  quality 
of  the  "pproximation  is  given  by  Fig.  3>  showing  two  approximations  to  the 
same  contour  obtained  by  using  grid  squares  of  the  size  used  in  Fig.  2  for 
one  and  grid  squares  of  half  that  size  for  the  other.  A  more  definitive 
criterion  for  choosing  the  grid  size  to  be  used  may  be  derived  from  consid¬ 
eration  of  Fig.  4,  which  illustrates  that  if  the  grid  is  too  coarse,  it 
may  depend  upon  the  exact  location  of  a  figure  relative  to  the  grid  whether 
significant  features  of  the  contour  appear  in  the  approximation  or  not. 
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Fig.  2  -  Successive  steps  in  approximating  a  smooth  contour  by 
straight-line  segments:  (a)  A  grid  is  laid  over  the 
figure,  (b)  All  grid  squares  partially  black  originally 
are  made  completely [black;  approximate  boundary  is  con- ■ 
structed  by  connecting  midpoints  of  black  boundary  j 
squares. 


It,  Variations  in  detail  of  approximate!  contour  brought  about  by  shifting 
grid.  Indentation  in  original  contiour  is  reproduced  in  approximation 
with  grid  placed  as  in  (a),  missed  with  grid  placed  as  in  (b). 
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It  is  clearly  desirable  that  fixing  the  grid  size  should  determine  the 
mimimum  size  of  detail  to  appear  in  the  approximate  contour.  Variations  in 
the  detail  because  of  random  placing  of  the  original  contour  relative  to  the 
grid,  as  between.  Fig.  4a  and  Fig.  4b,  must  be  considered  a  type  of  noise. 

*  .Such,  noise  'Has  two  effects  upon  the  4 >(i)  function.  .  First,  the  magnitude  of 
the  tangent  angle  4>  is  changed  in  the  vicinity  of  tihe  distortion  of  the 
contour.  Second,  the  length  of  the  contour  is  charged.  The  second  effect 
is  the  more  important,  for  if  distortions  of  one  part  of  a  contour  signifi¬ 
cantly  change  its  length  while  the  rest  of  the  contobr  is  unchanged,  the 
overall  form  of  the  $(,0)  function  may  be  greatly  altered  (Figs.  5  and  6)j, 
with  an  attendant  effect  upon  the  Fourier  series  coefficients. 


The  fact  that  noise  of  the  type  described  idoes  change  both  the  "dependent" 
and  the  "independent"  variable  in  the  <f(i)  representation  (which,  incidentally^ 
means  that  the  noise  is  not  additive)  changes  the  very  nature  of  the  filtering 
problem.  Superimposed  noise  caused  by  irrelevant  fine  detail  might  be  re¬ 
moved  by  a  low-pass  filter.  In  fact,  since  only  a  limited  number  of  harmonics v; 
are  considered  in  the  Fourier  series  expansion,  the  process  of  extracting 
descriptors  would  by  itself  remove  the  high-frequency  noise  components.  Under 
the  actual  circumstances,  no  linear  filtering  of  the  4>(jC)  function  is  useful 

_ J _ _ _ _  J  1 _  1  •  1  II  n  .  -  .  .  ... 


for  reducing  the  noise  caused  by  distortions, 
must  be  carried  out  on  the  act  al  contour. 


Instead,  the  smoothing  process 


Smoothing  and  approximation  of  a  given  contour  are,  in  principle, 
different  operations.  In  practice,  they  are  re|adily  Combined.  The  purpose 
«  of  smoothing  is  to  replace  by  smooth  arcs  those  sections  of  a  contour  con¬ 
taining^  irrelevant  fine  detail.  The  purpose  of  approximation  is  to  reduce 
curved  contours  to  straight-line  segments  bound  to  a  grid.  The  combined 
smoothing- approximating  operation  ideally  Should  transform  a  given  contour 
containing  detail  finer  than  that  desired  jo  &  Istraight-line  approximation, 
with  all  the  detail  finer  than  some  specified  sjize  eliminated  and  all  coarser 
features  indicated  rather  faithfully.  I  /  |  j  «.  ! 


It  must  be  realized  that  the  ideal  described  above  cannot  be  achieved. 
Any  process  incorporating  a  quantizing  step  is  subject  to  quantizing  noise. 
Here  it  will  manifest  itself  in  the  unreliability  of  the  designation  of  a 
square  on  or  near  the  actual  contour  as  "white"  or  "black,"  Thus  the  "size" 
of  a  given  piece  of  detail  cannot  be  fixed  exactly,  and  it  is  not  possible 
to  distinguish  in  all  cases  between  "fine  irrelevant  Retail"  to  be  smoothed 
out  and  "significant  features"  to  bo  retained.  The  criterion  for  smoothing 
must  therefore  be  based,  here  as  in  other  cases,  upon  the  likelihood  that 
certain  classes  of  features  are  in  error  and  upon  the  seriousnesk  of  the 
influence  such  error  might  have  upon  the  descriptors. 


No  quantitative  description  of  noise  effects  is  available.  Qualitatively, 
it-may  b'"'  said  that  the  most  probable  noise  consists  of  irregular  serration 
of  the  boundaries  between  areas  and  other  minor  discontinuities  in  the  i 
contours.  Jlhe-^ffetrtr  of  such  noise  may  be  to  obscure  the  gross  featured  of 
the  contour,  as  illustrated  in  Figs.  5  and  6. 


The  smoothing- approximation  technique  developed  to  minimize  the  effects 
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-  Effects  of  noise  introduced  by  serrations  of  contours:  (a)  Contour 
(b,c)  two  possible  approximations.  Tangent  angle  as  function  of 
distance  along  contour  for  approximation  (b)  is  shown  by  (d)  and 


for  approximation  (c)  by  (e). 
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(c) 


v  Vd>  \ 

7  -  Successive  steps  in  tracing  approximate  outline  of  a  figure  stored 
in  computer  memory  as  a  zero-one  matrix.  E's  denote \ elements  ex¬ 
amined  and  found  zero;4B^'s  denote  successive  boundary  squares; 
small  superscripts  show  order  in  which  tests  are  performed. 
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of  such  noise  will  now  be  described  in  terms  of  its  digital-computer  im¬ 
plementation.  Subsequently,  some  indication  will  be  given  of  how  the  same 
results  could  be  obtained  working  with  the  ATRID  device. 

| 

A  given  figure  is  stored  in  computer  memory  as  a  zero-one  matrix,  each 
"zero"  element  representing  a  "white"  square  and  each  "one"  element  repre¬ 
senting  a  "black"  square.  The  first  boundary  square  is  found  by  scanning  a 
given  line  of  the  matrix  until  a  "one"  element  is  found.  If  no  smoothing 
is  to  be  performed,  the  elements  corresponding  to  adjacent  squares  are 
examined  in  clockwise  order  until  another  "one"  is  found.  This  corresponds 
to  the  next  boundary  square,  The  squares  adjacent  to  it  are  again  examined 
in  clockwise  order,  and  so  forth.  Figure  7  shows  successive  steps  of  the 
search,  ■  |:L  i  i 

Smoothing  of  a  contour  is  achieved  by  changing  certain  initially  "white" 
squares  to  "black"  if  such  change  could  effect  a  straightening  of  a  piece 
of  a  contour  by  filling  in  breaks  or  gaps.  The  criterion  for  the  change 
is  established  as  follows:  Assume  a  particular  boundary  square  is  given. 
Examine  in  clockwise  order  the  squares  beyond  the  immediately  adjacent  ones. 
The  first  one  to  be  found  black  will  be  a  boundary  Square  also.  It  is 
desired  that  the  boundary  between  these  two  points  rae  straight.  Accordingly, 
the  intervening  squares  are  made  black  if  necessary ,\so  as  to  allow  them  to 
be  boundary  squares.  The  approximation  theh  proceedsX  as  before;  the  squares 
jadjacent  to  the  last  square  of  the  approximate  contour  are  examined  in  clock¬ 
wise  order,  and  the  boundary  is  extended  to  the  midpoint  of  the\first  one 
found  to  be  "black." 

Figure  8  shows  successive  steps  of  the  combined  smoothing- approximating 
process.  The  process  illustrated  will  smooth  out  detail  of  size  on  the 
order  of  one  or  two  squares.  This  appears  to  correspond  to  the  most 
reasonable— combination  of  choice  of  grid  size  and  definition  of  extraneous 
detail.  It  is  the  only  smoothing  scheme  actually  programmed.  It  is  clear, 
however,  that  the  concept  could  be  extended  to  e^^amining  squares  further 
away  and  changing  to  black  more  than  one  square  on  the  basis  of  one  examina¬ 
tion. 

Figure  8a  shows  the  scanning  process,  which  is  the  same  as  that  in  Fig. 
7a.  Figs.  8b,  d,  f,  and  h  show  the  order  of  searching  ahead  for  a  boundary 
square.  In  Figs.  8c,  e,  g,  and  i  the  X  indicates  the  square  thus  found, 
and  the  starred  "one"  is  that  element  which  must  as  a  consequence  correspond 
to  a  "black"  square  if  the  contour  is  to  be  smooth.  It  may  be  noted  that 
in  all  but  Fig.  8g,  the  starred  element  was  "one"  initially.  It  may  also 
be  noted  that  the  starred  element  in  Fig.  8e  does  not  correspond  to  a  boundary 
square,  because  the  clockwise  search  about  .detects  another  'one  element 
first. 

If  the'fCTKTtTdevIce  were  to  be  used  for  tracing  contours,  a  somewhat 
different  procedure  would  apply.  In  the  ATRID  core  matrix,  the  cores  con¬ 
taining  ones  are  those  corresponding  to  grid  squarejs  in  which  there  occurs 
the  transition  from  one  level  to  another,  i.e.,  frota  "black"  to  "white  or 
"white"  to  "black."  These  are  boundary  squares.  Formally  this  definition 
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Fig.  8  -  Successive  steps  in  process 


that  combines  smoothing  and  approxi¬ 
mation  of  the  outline  of  a  figurk  stored  in  computer  memory  as  a 
zero-one  matrix.  ■  V 
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of  "boundary  square"  is  different  from  jthat  previously  given,  but  in  1 
practice  there  is  no  real  difference.  For  the  operation  of  curve  tracing, 
a  core  known  to  be  on  the  boundary  could  be  interrogated  while  successively- 
paired  with  the  cores  adjacent  to  it.  If  smoothing  were  to  be  performed 
also,  the  cores  could  be  interrogated  in  sets  of  three,  with  pulses  large 
enough  to  give  a  response  if  any  two  of  the  cores  were  set. 


Ill,  Reconstruction  of  Figures  from  ■  ;■  j 

Their  Descriptors 

The  normalized  tangent-angle  vs.  arc-length  curve  is  a  complete  ■  ; 
description  of  the  shape  of  a  figure.  Any  truncated  Fourier  series  repre-  ; 
sentation  of  this  curve  can  be  only  an  approximate  description.  Since  the  *L.^ 
descriptor  vector  used  to  characterize  and  classify  a  given  contour  consists 
of  the  coefficients  of  such  a  truncated  Fourier  series,  it  is  of  interest 
to  know  how  the  number  of  harmonics  present  affects  this  description. 

A  computer  program  was  written!  to,  reconstruct  a  contour,  given  the  . 
descriptor  vector  with  a  specified  nuiHber  of  components.  Basically,  the 
computer  is  programmed  to  construct  a  cdntrur  of  the  same  fprm  as  the  approxi 
mate  contours  discussed  previously; j that  is,  [one  consisting  of  straight-line 
segments  connecting  the  midpoints  of  adjacent!  grid  squares.  The  coarseness 
of  the  grid  may  be  adjusted  by  specifying  the\length  to  be  assumed  for  the  : 
contour  in  terms  of  grid  squares.  The  program  proceeds  by  evaluating  the 
Fourier  series  at  the  approximate  midpoint  of  the  next  segment  to  be  con¬ 
sidered.  (The  exact  midpoint  is  unknown,  a  priori,  because  the  segment 
length  may  be  either  1  or  \JTT  units.)  The  valuik  of  the  angle  computed  at 
this  point  is  rounded  to  the  nearest  of  the  eight  possible  values,  and  the 
contour  is  extended  by  a  line  increment  at  that  angle.  An  effort  is  made  to 
prevent  roundoff  error  from4 accumulating,  in  that  a  correction  term  equal  t,  : 
to  the  roundoff  error  at  the  previous  point  is  added  to  th^  angle  computed 
at  a  given  point. 

It  is  clear  that  this  reconstruction,  technique  yields  only  an  approxi-  | 
mation  to  the  contour  actually  specified  by  the  Fourier  coefficient 
descriptors^^JThe  approximation  becomes  exact  in  the  limit  as  the  grid  be- 
comers~'lnlTnitesimally  fine.  The  accuracy  of  the  approximation  may  be  judged; 
Dy  comparing  two  reconstructions  of  the  same  figure  based  on  the  same  de¬ 
scriptors,  with  one  having  a  grid  twice  as  fine  as  the  other  (Fig.  9). 

Figures  10-12  show  several  contours  reconstructed  on  the  basis  of  various 
numbers  of  harmonics. 


IV.  Standardization  of  Phase 

It  has  been  shown  in  Sections  I  that  a  given  contour  can  be  reduced  to 
a  4>  ( jg )  curve  which  is  periodic  and  the  shape  of  which  is  independent  of  the 
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Fig.  9 


Two  reconstructions  of  the  same  figure  from  a  descriptor  vector 
of  30  components  (cosine  and  sine  coefficients  of  first  15 
harmonics).  The  reconstruction  shown  in  (a)  is  based  on  the 
calculation  of  the  slope  of  the  contour  at  roughly  twice  as  many 
points  as  the  reconstruction  in  (b). 
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starting  point  used  in  tracing  the  contour.  The  amplitude  and  the  relative 
phase  of  each  harmonic  are  thus  fixed.  Howcvfer,  the  absolute  phase  may  vary 
arbitrarily,  depending  upon  the  starting  point  selected  for  tracing  the 
contour.  A  descriptor  vector  consisting  of  Fourier  series  coefficients  may 
be  constructed,  no  matter  what  starting  point  is  chosen.  All  such 
descriptor  vectors  are  "correct"  and  equivalent  in  that  the  figure  which 
can  be  reconstructed  with  a  given  number  of  harmonics  is  the  same,  (it  is 
assumed  that  enough  harmonics  are  taken  for  the  contour  to  close;  obviously 
any  break  in  the  contour  will  occur  at  the  starting  point  selected.) 


If  the  descriptor  vectors  are  to  be  used i to  classify  figures,  some 
restraint  on  the  starting  point  must  be  impbsid.  Classification  techniques 
depend  upon  grouping  the  end-points  of  n-dimension4l  vectors  in  n-space. 
Objects  described  by  descriptor  vectors  kre  recogriized  as  being  similar  if 
their  vectors  fall  in  the  same  region  of  n-space.  if  the  starting  point 
for  tracing  the  contour  is  allowed  to  vary  arbitrarily  along  the  contour, 
a  single  given  contour  will  lead  to  a  set  of  descriptor  vectors  specifying 
points  in  n-space  the  locus  of  which  irill  be  a  rather  complex  curve.  Its 
projection  on  any  one  axis  will  range  between  the  positive  and  negative 
value  of  the  amplitude  of  the  associated  harmonic,  it  is  therefore  clear 
that  uncontrolled  variation  of  the  s tar ting\ point  used  for  tracing  contours 
makes  meaningful  comparison .of  descriptor  vectors  quite  difficult. 

Fortunately,  any  descriptor  vector  obtained  upon  starting  to  trace  a 
contour  at  an  arbitrary  point  can  be  transformed  to  any  of  the  other 
possible  descriptor  vectors  by  a  completely  determinate  linear  transfor¬ 
mation.  A  shift  in  starting  point  amounts  to  merely  a  shift  in  phase  of 
the  periodic  $(4)  curve.  Consider  a  given  2n  dimensional  descriptor  Vector, 
consisting-of-H^pairs  of  cosine  coefficients,  A^,  and  sine  coefficients,  B^: 


V  ■  Al,  Bi,  — ,  Aj.,  Bi,  — ,  An,  Bn 


Ci  -  VV  +  Bi2 


and  conversely 
<l-5a>.  -  ^ 


(15b) 


,  Bi 

«  tarr1  ; 


A^ ■ “  cos 


B^  »  sin 
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For  any  two  vectors,  Va  and  Vb,  describing  the  same  contour, 


a  b 

Ci  =  ct  , 


since  the  harmonics  are  of  the  same  magnitude.  -However,  in  general 


a  b  y 

a  +  6 j_,  where  9]  f  0. 


Similarily,for  another  harmonic 


a  b 
Cj  aCJ 


a  b 

*j  *  +  sj  • 


and,  because  of  the  geometrical  interpretation  ofthe  quantities, 


i  0i 


By  Eq.  (20 )  the  phase  shift  'of  every  harmonic  can  be  computed  if  it  is 
known  for  any  one.  Tnus,  an  apparent  change  in  the  starting  point  can  be 
achieved  by  operations  upon  the  descriptor  vector. 

The  standard  starting  point  to  be  used  in  developing  vectors  for  the 
classification  of  contours  must  be  specified  in  terms  of  properties  of  the 
vectors,  since  no  salient  features  of  the  contours  themselves  are  known  a 
priori.  For  obvious  reasons  this  starting  point  must  be  unique.  It  must  be 
specified  in  terms  of  the  phase  angles  of  the  harmonics.  As  a  practical  ,  | 
matter,  the  contour  may  be  traced  starting  at  any  convenient  point,  and  a  ■  ;F 
virtual  shift  of  the  starting  point  may  be  brought  about  by  means  of  trans¬ 
formations  derived  from  the  equations  given  above. 

1  !i  !  .  '  ;•  : ;  L;1 

:i  ■  •• 

A  starting  point  may  be  uniquely  specified  by  the  requirement  that 
either  the  phase  angle  of  the  fundamental  or  some  function  of  the  phase 
angles  of  several  harmonics  assume  a  specified  value,  which  may  be  zero. 
Specification  of  a  starting  point  in  te;rm3  <*5f  the  phase  angle  of  any  one 
harmonic,  other  than  the  fundamental,  doe  is  not  lead  to  a  unique  result,  since 
any  given  phase  angle  occurs  at  several;  points  abound  the  contour. 


Specification  in  terms  of  the  fundamental  harmonic  alone  is  unsatis¬ 
factory  because  the  amplitude  of  that  harmonic  may  be  zero  for  certain  figures 
(e.g,,  any  regular  polygon)  and  the  angle  may  be  indeterminate.  Two  different 
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were  that  the  starting  point  be  seleeLd  Eoh  thafe'her  U°M 
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flmrnJlea?h  cafe  efCh  phase  angle  was  weiehted  by  the  corresponding 
amplitude  in  order  to  eliminate  the  difficulties  that  would  arise  if  a  civen 
harmonic  were  identically  zero,  with  indeterminate  phal^or^ such  smSl 

phase^ver^a^i derange?1"  pertUrtatl°is  |6<j  *■»■«  ««ld  ehange  the 

as  coodeisrthSiHLf°?’  accepti"6  a  Siven  way  of  (specifying  the  starting  point 
as  good  is  that  descriptor  vectors  of  similar  figure  be  consistently  closely 
grouped.  To  test  the  two  bases  for  selecting  starting  points  given  by  Eqs/7 

vectors  compa^ed?Veral  ^  °f  COn,tOUrs  Were  generated  and  their  descriptor 

a  different  objects  (models  of  an  A3J  aiicraft,  a  B47  bomber,  and 

a  T34  tank)  were  photographed  from  a  widb  range  or  azimuth  and  elevation 
angles.  Their  silhouettes  were  traced  and  the  first  six  harmonics  of  the 
Fourier  series  expansion  of  their  tangent-angle  vs.  arc-, length  description 
were  computed.  Transformation?  were  performed  upon  the  resulting  descriptor 
vectors  to  obtain  both  a  set  of  vectors  satisfying  the  requirements  of  Eq. 

(21)  and  another  set  satisfying  Eq.  (22).  * 

To  ob tainsome vindication  of  how  the  end-points  of  these  vectors 
clusteredr-trr-the  descriptor  space,  projections  of  that  12-space  upon  a 
set  of  mutually  perpendicular  planes  were  plotted.  In  particular,  out  of 
the  Ob  possible  planes,  those  six  were  chosen  Which  resulted  in  the  sine 
coefficient  of  each  harmonic  being  plotted  against  the  cosine  coefficient. 
iz  was  found  that  the  vectors  satisfying 


appeared  very  closely  grouped  in  the  projections  upon  the  Ak-Bk  planes  for 
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small  k,  but  were  quite  widely  scattered  for  larger  k  (Figs.  13  and  14). 
For  this  reason  the  criterion  of  Eq.  (22)  may  be  considered  inferior  to 
that  of  Eq.  (2l).  1 


The  clustering  resulting  from  imposition  of  the  requirement 


£ci»i  - 


is  shown  in  FigjL^-lJ-SO. 


It  should  be  noted  that  the  clusters  for  bjie  A3J  and  the  B47  are  both 
disjointed  from  that  for  the  T34  in  the  A±  -  Biplane,  and  the  clusters  for 
the  A3J  and  the  B47  are  disjointed  in  the  A3  -  B3  plane.  Thus  an  unknown, 
object  from  one  of  these  clusters  could  be  tentatively  identified  from  the 
projection  of  its  descriptor  vector  upon  these  two  planes.  It  should  also 
be  noted  that  this  disjointedness  is  not  evident  from  the  graph  showing 
the  range  of  individual  components  of  the  descriptor  vectors  for  the  various 
targets  (Fig.  21 ). 


V,  Pattern  Recognition 

The  term  descriptor  will  be  used  to  denote  a  number  whose  value  is  a 
measure  nf  t.fip-^p-grpp  to  which  the  figure  under  consideration  possesses  a 
.jcertain  property.  This  number  may  be  a  binary  one  or  zero,  denoting  the  presence 
or  absence  of  some  specific  feature  and  thus  giving  an  essentially  qualitative  • . 
description  of  the  figure;  or  the  number  may  assume  a  value  proportional  to 
some  characteristic  of  the  figure  which  may  be  specified  quantitatively. 

No  restriction  is  placed  on  the  properties  that  may\>e  characterized  by 
descriptors.  They  may  be  features  that  can  b^  recognized  by  inspection 
(e.g.,  whether  the  figure  falls  into  a  certain  region  of  a  superimposed  grid),  ■ 
by  simple  measurement  (e,g.,  height,  width),  dr  by  arbitrarily  complex 
computation  based  on  any  number  of  physical  measurements  (e.g.,  height- to- 
width  ratio).  Because  of  the  variety  of  different  quantities  that  the  term 
must  designate,  the  work  "descriptor"  appears  more  suitable  then  "measure¬ 
ment,"  which  has  also  been  used  in  this  sense.7  When  a  set  of  several 
descriptors  pertaining  to  a  given  figure  is  to  be  used,  it  is  convenient / 
arrange  them  in  a  fixed  order.  Such  an  ordered  set  of  descriptors  will  b 
called  a  descriptor  vector.  j  !  j'Lt 

1  ’  / !  i 

Regardless  of  how  the  descriptor  vectors  are  obtained,  they  are  useful  ;r:  L 
for  chrracter  or  pattern  recognition  if,  and  only  if,  the  following  two  re-  jLi; 

quirements  are  satisfied:  (l)  It  is  easier  to  perform  comparisons  of  the  ■  : 

descriptor  vectors  than  of  the  figures  themselVes,  and  (2)  the  descriptor  1 

vectors  of  similar  figures  appear  similar  under  the  comparisons  performed. 


of  the  cosine  and  sine  coefficients  of  the  fifth  harmonic. 
Descriptor  vectors  have  been  adjusted  to  satisfy  Equation  (22) 
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Fig.  15  -  Projection  of  descriptor  vectors  of  three  objects  viewed  from 
various  orientations  upon  the  plane  of  the  cosine  and  sine 
coefficients  of  the  first  harmonic.  Descriptor  vectors  have 
been  adjusted  to  satisfy  Eq.,  (21). '  Note  that  cluster  of  T34 
tank  is  disjointed  from  clusters  of  both  aircraft.  | 
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Fig.  16  -  Projection  of  descriptor  vectors  ofuhree  objects  upon  the  plane 
of  cosine  and  sine  coefficients  of  the  second  harmonic.l 
Descriptor  vectors  satisfy  Eqj  (21).  1 
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Fig.  17  -  Projection  of  descriptor  vectors  of  three  objects  upon  the  plane 
of  cosine  and  sine  coefficients  of  the  third  harmonic.  Descriptor 
vectors  satisfy  Eq.  (21).  Note  that  clusters  for  the  two  aircraft 
are  disjointed.  j. 
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T  3  4  * 

Fig.  18  -  Projection  of  descriptor  vectors  of  three  objects  upon  the  plane 
of  cbsine  and  sine  coefficients  of  the  fourth  harmonic. 
Descriptor  vectors  satisfy  Eq.  (21) .  ~ 


Fig.  19  -  Projection  of  descriptor  vectors  ofj  three  objects  upon  the  plane 
of  the  cosine  and  sine  coefficients  of  the  fifth  harmonic. 
Descriptor  vectors  satisfy  Eq.  (21).  The  appearance  of  two  dis¬ 
tinct  clusters  for  the  T34  tank  is  attributed  to  the  significant 
difference  between  those  views  Jin  which  the  tank's  gun  is  visible 
and  those  in  which  it  is  not. 
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The  present  report  deals  with  descriptor  vectors  the  components  of  ,  A 
which  are  the  cosine  and  sine  coefficients  of  the  Fourier  series  expansion, 
of  the  $(i)  curve  of  a  given  figure.  It  is  clear  that  is  is  easier  to 
formulate  a  measure  of  similarity  for  the  descriptor  vectors  than  it  is  to 
do  this  for  the  contours  which  these  descriptors  represent.  It  is  also 
clear  that  continuous  deformation  of  a  contour  will  lead  to  continuous 
change  of  the  4>(i)  curve  and  continuous  variation  of  the  Fourier  coefficients. 
Thus  the  similarity  of  two  figures  should  manifest  itself  in  the  components 
of  the  vectors  being  approximately  equal. 

I  ■■  ■  •  -  ::  • 

There  are  several  standard  techniques  for  classifying  patterns  on  the 
basis  of  n-dimensional  descriptor  vectors  of  some  type.  In  one,  the  dis¬ 
tance  is  measured  in  n- space  from  the  erid-point  of  the  descriptor  vector 
derived  from  a  given  unidentified  figure  to  the  end-points  of  descriptor 
vectors  derived  from  a  set  of  standard  figures.  The  unknown  is  then 
identified  as  the  character  associated  with  that  vector  to  which  this  dis¬ 
tance  is  smallest.  Various  transformations  and  deformations,  of  the 
n-dimensional  vector  space  may  be  carried  out  to  improve  the  likelihood  of 
correct  identification  if  the  statistical  properties  of  the  descriptors  are 
known. 8 

.  \  \  \ 

Another  standard  technique  involves  subdividing  the  \n-dimensional 
descriptor  space  into  cells  or  regions.  The  components  of\ thq  descriptor 
vector  of  an  unidentified  figure  are  considered  aB  the  coordinates  of  a 
point.  The  unknown  character  is  then  identified  as  belonging  to  that  class 
in  whose  region  in  n-space  this  point  falls.8 

In  general,  no  direct  correspondence  can  be  established  between  the 
measjirg_jD£-^imilarity  1°  terms  of  the  descriptor  vectors  and  the  similarity 
of  various  features  of  the  character  themselves.  For  the  descriptors  under 
consideration  here,  measures  of  similarity  ^o  exist  which  have  definite 
physical  interpretations  both  in  terms  of  the  vectors  in  n-space  and  in 
terms  of  the  $(i)  contour  representation.  Consider  pairs  of  vectors  in 
n-space j  a  plausible  measure  of  similarity  is  the  angle  9  between  them.  The 
smaller  this  angle,  the  more  nearly  parallel  and,  in  this  sense ,Jpore  .nearly 
similar  are  the  vectors.  Now  the  angle  9^q  between  two  vectors  A  and  5  is 
given  by 


A  •  B 

°°s  wur 


COS  = 


(X 


IAiI 


Bi2)  * 
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If  the  components  of  the  vectors  A^and  B  are  the  Fourier  coefficients 
of  *(£)  functions  derived  from  two  contours,  it  can  be  shown  that, cos  0^, 
the  measure  of  similarity  in  the  descriptor  n-space,  is  the  approximate 
normaitzecr'cross- correlation  P$a<t>b(o)  of  the  functions  <t>a(-0  and  <t>b(-0* 

By  definition,  the  cross-correlation  \ 


(25) 

For 

and  *b  periodic  w 

to 

(26) 

fy**b^  **  x 

Now 

”  09 

(27a) 

♦»<*>  -  A 

and 

00 

(27b) 

♦b(i)  a  A  : 

J-l 

since  the 

<t>(j!)  functions  have 

^T/2 
■  T/2  4 


cos  +  B2j  sin  j£  , 


Substituting  the  expressions  for  <t>a  and  of  Eqs.  (27a  and  b)  into  Eq, 
(26),  and  taking  into  account  that  T  *  2JT,  one  may  write 


1  rnr  r1  r* 

(28)  %*b(0)  -  gj  /  \  L  L  (A2i-1  cos  i£  +  A2i  sin 

J  .  1  J _ 1 


1  r®r 

i=l  j=l 

x  (Bsj.^  cos  +  B2j  sin  &£  . 


ii) 


Carrying  out  the  indicated  multiplication  and  interchanging  the  order 
of  summation  and  integration,  one  obtains 
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(29) 


00  00  . 

s  }_j  }_,  (A2i-iB2 j-i )  cos  it  cos  ji  d i 

i=l  j=l  j  ! 

i  1  rn 

+  (AefBaj)  sin  it  sin  ji  di 

1  PR 

+  (Aei.iBaj)  ^ cos  it  sin  ji  di 

1  Pn 

+  (A2iB2j_1)  sin  ii  cos  ji  d t. 


Taking  into  account  that 


(30a) 

(30b) 

(30c) 

one  then  may  obtain 


1  rn 

—r  /  cos  it  cos  At  d t  *»  6j  , 

nJ-n  I  j  j  1J 

if11  I  . ;  j  I 

sin  it  siri  ji  di  a  Lj  , 

1  rn 

—  /  cos  ii  sin  j i  di  »  0, 

n  J-n  1 


and 


(31)  ^a^b^0^3  2  ^-1*  X  ^i-i^d-i^ij-  +  (A2iB2i)5ij  > 

i«l  0*1  .... 

which jmyJbe--rewritten  as 

00 

(32)  R*aR*b(0)=|X  AiBiV 


i»l 


The  normalized.  cross-correlation  p  is  defined  as 


p4»a*b^ 


H,  „  (l) 

’ab 
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which  has  the  property  that 

4 

with 


p*a*b 


(0)  «  1; 


\ 


♦*0O  -  C  *b(i), 


if  ♦a  and  $b  are  periodic,  piecewise  continuous  functions.10 

Novj  while  the  Fourier  expansion  of  a  <t>(,e)  function  in  general  leads  to 
an  infinite  series,  it  has  been  shown  already  (Section  III)  that  for  practical 
purposes  the  function  may  be  adequately  represented  by  a  truncated  Fourier  i ;!/ 
series  of  n/2  harmonics,  where  n  is  the  number  of  components  in  the  corres¬ 
ponding  descriptor  vector.  ^ 

i:-'.  i 

Combining  Eqs.  (32)  and  (33) >  and  taking  into  account  that  Aj  and  Bj 
are  zero  f-or-ir  >  n,  one  obtains  ^  ' 


n 


i 


(37) 


L  AlBi 

i=l 


n  •>  ,n 

(McH  m 

i-l  i-1 

I  •  '  '  v.  : 

The  expression  for  p*  4  in  Eq.  (37)  in  precisely  that  of  Eq.  (24)  for 


c°s  Therefore 


(38) 


P*a*b(0)  "  C0S  ®AB* 


In  view  of  this  correspondence,  the  two  previously  mentioned  bases  for 
classifying  figures  maybe  evaluated.  Consider  first  the  technique  of 
classifying  figures  according  to  the  distances  between  the  end  points  of 
their  descriptor  vectors  and  the  end  points  of  the  set  of  standard  vectors. 
With  the  loss  of  some  information,  all  the  vectors  may  be  reduced  to  the 
common  length  unity  by  the  transformation 


3*+ 


(39) 


y 


BEST  AVAILABLE  COPY 


which  implies  that 


f  a| 


(bo) 


H 


[  M 


i=l 


From  simple  geometrical  consideration  it  is  clear  that  the  measure  of 
similarity  for  this  case,  -  ..  L.:.;  - 


(41) 


n  i 

-  £(V Bi'f  > 


i=l 


is  equivalent  to  the  measure  cos  0^  in  the  sense  that  both  will  give  the 
same  ordering.  '  ^  • 

It  is  intuitively  clear,  therefore,  that  the  measure 


(42) 


n 

V 


i»l 


is  better,  since  it  is  based  on  a  more  complete  description  of  a  given  char' 


acter. 


.1 


I  : 

It  is,  in  fact,  the  mean  square  difference  of\the  4 (£) ,  functions  of  the; 
figures  associated  with  A  and  B.  This  may  be  shown  by  the  flowing:  Con¬ 
sider  the  functions  *aC0  and  **,(*)  with  zero  mean  and  perio^2II.  Their  mean 

square  difference  is  L 


(43) 


ems  -  Si//'*!')  ' 


As  before,  one  may  adequately  represent  4&(t)  and  4>b(i)  by  their  truncated 
Fourier  series  of  n/2  harmonics.  Then  \ 
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i  rnr  v  i 

EMS  =  2n J  /^ai-i  cos  ^  +  A2i'  sin  iZ  -  B^..^  cos  ii  -  B2i  sin 

"n  H  I  ;  tl  !  -* 


cos  IZ  +  (A2i~B^i)  sin  iij 


«V  <_  -  V 

-  [£«*  COS  j.0  +  (A2j-B2j)  sih  jijc 

J»1 

n/2  n/2 

^  7  (^ei-x*B2£.i  )(A2j-1-B2(j-i )  cos  ^  cos 


i  rn  • 

+  (A2i-i-B2j[.i  )(A2j-B2j)  2H  cos  ^  sin 

i  rn 

+  (A2^-B2i)(A2j_,x-B2j_1)  21 xj  sin  ^  cos 

-II 

1  rn 

+  (A2l-B2i)(A2j-B2j)  sin  iZ  sin  jZ  d Z. 

faking  into  account  Eqs.  (30a,  b,  c)  one  may  reduce  Eq..  (44)  to 

I 

n/2  n/2  \ 


EMSa  2^_,  ^  (Aai-l-Bai-i)(Asj_i-B2j-i)S^  +  (B2i-B2i)(A2.j-B2j)6i 
i=l  J-l  !  ■  i 
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Comparing  the  expression  for  EMS  of  Eq.  (45)  with  that  for  D2  in  Eq. 
(32),  one  obtains  the  equivalence  of  the  criteria 


(46) 


i  D2 


E, 


'MS* 


In  view  of  Eq.  (46),  the  techniques  of  improving  recognition  by  dis-  : 
torting  the  vector  space,  to  expand  some  components  and  contract  others,  can 
be  reinterpreted  as  faltering  of  the  <t»(i)  functions. 


-  -  The  second  standard  technique  of  character  recognition  which  has  been  if 

mentioned  above  involves  classifying  figures  according  to  the  regions  of 
n-space  occupied  by  the  end-points  of  their  descriptor  vectors.  The  regions 
may  be  defined  by  hyperplanes  separating  each  paiXof  different  figures  in 
the  set  of  distinct  figures  expected.  The  classification  operation  cijauld 
be  performed  upon  a  given  <t>(i)  function  by  a,  device  consisting  of  phase-  1 
sensitive  detectors  to  generate  the  descriptor  vectors  and  a  resistance-diode 
matrix  to  effect  the  separation  by  hyperplanks.  it  can  be  shown  that  the  1 
classification  scheme  based  upon  this  principle  will  lead  to  greater  likeli¬ 


hood  of  correct  identification  than  any  scheme  based  upon  the  measure  D* 


even  if  the  descriptor  space  is  linearly  distorted  in  such  a  manner  as  to 
improve  the  grouping  of  classes.9 


So  far,  relations  have  been  shown  between  the  measures  of  similarity  in 
vector  space  and  measures  of  similarity  between  different  <t>(-0)  curves.  It  ; 
remains  to  be  demonstrated  that  these  have  relevance  to  the  recognition  pro¬ 
blem  in  the  sense  that  figures  that  appear  similar  to  a  human  being  lead  to 
41  (i)  functions  that  are  similar  according  to  criteria  that  can  be  formulated 
mathematically. 


The  evidence  available  indicates  that 
in  fact  reflected  in  the  similarity  of  the 
several  sets  of  experimental  evidence.  1 


the  similarity  of  the  figures  is 
4>(i)  curves;  this  is  indicated  by 


First,  a  series  of  comparisons  was  performed  on  the  descriptor  vectors 


were 


of  the  contours  of  Fig.  22.  The  measures  6f  Isimilarity  cos  6>AB  and  D‘ 
computed  for  each  pair  of  vectors.  Part  of  the  resulting  table  of  matches 
is  shown  in  Fig.  23,  where  the  figures  are  arranged  in  order  of  decreasing 
similarity.  It  can  be  seen  that  the  similarity  detected  in  the  vectors  does 
indeed  correspond  quite  well  to  the  similarity  that\a  person  looking  at  the 
figures  might  consider  reasonable.  It  should  be  noted  that  essentially  the 
same  ordering  of  best  matches  was  given  by  the  criterion  cos  and  the 
criterion  D2.  Furthermore,  it  was  found  that  adding  components  to  the 
descriptor  vectors  beyond  the  cosine  and  sine  term  coefficients  of  the  first 
three  harmonics  had  no  appreciable  effect  on  the  ordering. 


Second,  a  total  of  seven  aircraft  models  and  one  tank  modql  were  photo¬ 
graphed  from  various  elevation  and  azimuth  angles  and  12- dimensional  de¬ 
scriptor  vectors  were  computed  for  each  view.  (Besides  the  A3J  and  B47>  the 
aircraft  included  were  a  DC3,  F?4C,  B70,  B58,  and  Super  G  Constellation.) 
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Order  of  matching  and  coaine  of  angle 
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0.968  0.953  0.756  0.722 


0.954  0.766  0.662  0.655 


0.913  0.857  0.750  0.677 


0.768  0.767  0.742  0.722 


0.968  0.966  0.778  0.745 


0. 745  0. 722  0. 722  0. 666 


0.913 


0.887  0.682  0.662  0.657 


Fig.  23  - 


Order  of  matching  and  cosines  of  angles  between  descriptor 
vectors  for  selected  figuresV from  set  shown  in  Fig.  22. 
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By  looking  at  projections  of  the  descriptor  vectors  in  12- space  upon 
various  planes  (Figs,  14-19),  one  can  see  that  different  views  of  a  given 
object  tend  to  result  in  descriptor  vectors  which  fall  into  clusters,  and 
that  clusters  caused  by  different  objects  tend  to  be  disjointed. 


Each  components  of  the  descriptor  vectors  is  a  continuous  and  periodic 
function  of  the  azimuth  and  elevation  angles.  From  this  fact  one  can  deduce 
that  the  locus  of  the  descriptor  vector  end-points,  as  the  azimuth  and 
elevation  angles  range  through  all  possible  values,  is  a  closed  surface  on 
which  any  point  can  be  specified  by  two  parameters.  Thus  the  distribution 
of  the  descriptor  vectors  that,  is  to  be  expected  if  descriptors  are  derived 
for  various  views  of  one  given  object  varies  in  a  significant  way  from  the 
distribution  to  be  expected  if  descriptors  are  derived  for  a  number  of 
similar  (but  not  identical)  objects  looked  dt  in  the  same  way.  The  latter 
set  of  descriptor  vectors  could  be  expected  to  form  a  cluster,  with  points 
dense  near  its  center  and  sparsely  distributed  farther  from  the  center. 

This  difference  is  significant  for  practical  purposes  when  descriptors 
are  to  be  used  for  classifying  figures. |  A  common  way  of  identifying  an  un¬ 
known  vector  involves  measuring  its  disiancej  to  the  mean  vector  of  a  sample 
set  of  a  given  class.  The  unknown  is  then  identified  as  a  member  of  that 
class  if  the  distance  is  sufficiently  small. j  Thik  technique  is  reasonable 
only  if  one  expects  that  the  mean  vector  falls  in\a  region  where  the  sample 
descriptor  vectors  are  dense.  It  is  not  reasonable  if  the  sample  descriptor 
vectors  are  distributed  along  a  surface  that  may  ak  every  point  be  quite 
far  from  the  mean  vector.  j 

The  difference  between  the  two  cases  may  become  clearer  if  we  consider 
two  analogous  physical  situations  tha+  illustrate  the  difference.  In  each 
case,  we  shall  be  given  the  three  cartesian  coordinates  of  a  point  and  shall 
be  asked  if  the  point  f'alls  ihs.Me  either  of  a  pair  of  metal  Objects  fixed 
in  space,  which  we  shall  have  been  allowed  to  examine  and  measure  previously, 
but  which  will  no  longer  be  available  for  a  direct  test.  The  first  pair  of 
objects  will  be  rather  shapeless  lumps  of  solder.  Clearly,  we  will  be  able 
to  at  least  make  a  reasonable  guess  about  the  given  point  if  we  remember 
the  location  of  the  centers  of  gravity  of  the  lumps  and  very  roughly  their 
approximate  diameters..  The  second  pair  of  objects  will  be  contorted  loops 
of  thin  ^wire-i — nfls  clear  that  a  different  approach  is  needed;  the  centers 
of  gravity  of  the  wires  themselves  are  probably  not  in  the  wires,  and  it  may 
even  be  that  the  center  of  gravity  of  one  wire  lies  within  the  other. 

Some  impression  of  the  shape  of  distribution  of  the  descriptor  vectors 
may  be  gained  by  considering  Figs.  24a  and  b.  The  figure  shows  the  number 
of  descriptor  vectors  at  a  given  distance  from  their  means  as  a  function  of 
the  distance.  The  similarity  of  the  curves  to  Fig.  25c,  as  compared  to 
Fig.  25d,  shows  that  descriptor  vectors  do  in  fact  appear  to  be  along  surfaces 
away  from  their  mean. 

The  separation  between  groups  of  descriptors  derived  from  various  air¬ 
craft  viewed  from  different  angles  can  be  judged  by  considering  Fig.  26, 
which  shows  projections  of  the  end-points  of  vectors  on  a  plane  chosen  to 
show  the  maximum  separation  between  their  convex  hulls.  The  plane  chosen 
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for  projecting  any  three  groups  was  that  passing  through  their  centers.  It 
can  be  seen  that  more  refined  techniques  are  needed  for  distinguishing 
between  these  classes  thar.  simple  ideas  of  measuring  distances  to  one 
reference  point  for  each,  or  of  subdividing  the  space  by  hyperplanes. 

The  most  promising  approach  for  solving  this  problem  appears  to  be  to 
subdivide  a  class  into  subclasses,  each  of  which  can  be  effectively  charac¬ 
terized  by  its  midpoint.  Research  in  this  area  remains  to  be  done,  even 
though  techniques  that  separate  large  groups  into  suitable  subgroups  are 
known.11 


%  .  ..  „  ^ 


VI,  Alternate  Expansions  of  $(1)  Functions 

The  process  of  extracting  descriptors  for  a  figure  is  really  a  coiribin-, 
ation  of  two  related  but  different  processes.  The  first  consists  of  making  ; 
measurements  of  some  set  of  physical  attributes  of  |:he  figure.  The  second 
is  essentially  a  process  of  encoding  the  results  of  these  measurements  into 
a  set  of  numbers.  The  usefulness  <f>f  the  descriptors  obtained  ultimately 
depends  on  the  significance  of  the  attributes  that  are  measured.  The  coding 
process,  however,  is  highly  important,  for  it  may  have  a  great  influence 
upon  how  conveniently  the  resultant  descriptor  vectors  may  be  categorized. 

The  fundamental  attribute  of  a  figure  that  has  been  measured  throughout  the 
course  of  the  research  reported  here  is  the  angle  between  the  tangent  to 
its  contour  and  a  reference  line  as  a  function  of  distance  along  the  contour. 
The  computation  of  descriptors  by  expanding  the  $(.0)  function  in  Fourier 
series  is  only  one  of  many  possible  encoding  process. 

It  is  reasonable  to  try  to  express  the  $(i)  functions  as  a  series  of 
orthogonal  functions  and  to  consider  the  serids  coefficients  to  be  descriptors 
The  most  familiar  series  of  this  type  is  the  Fourier  series,  the  one  that 
has  been  used  so  far.  In  looking  for  other  sets  of  orthogonal  functions 
that  may  be  useful,  one  is  attracted  by  functions  which  assume  only  the 
values  ±  1.  Their  obvious  merit  is  their  simplicity.  It  is  very  easy  both 
to  expand  an  arbitrary  function  in  terms  of  l  it  [l  functions  analytically  and 
to  implement  circuitry  to  do  so  elcctronidallyf  (Fig; |27).  Nevertheless,  com¬ 
pletely  equivalent  results  can  be  obtained  even  morb  simply. 

It  will  be  shown  that  attempts  to  obtain  descriptor  vectors  by  expanding 
a  <t>  ( jg )  function  in  terms  of  several  sets  of  orthogonal  functions  whose  values 
are  limited  to  ±  1  can  yield  results  no  more  useful  than  those  of  a  much 
simpler  scheme  for  extracting  descriptors.  ^ 

There  are  several  ways  of  generating  functions  Waving  the  desired  pro¬ 
perties  .  The  most  instructive  begins  with  a)  consideration  of  the  properties 
of  Hadamard  matrices,  A  Hadamard -matrix  of  order  n  is  defined  as  an  n  x  n 
matrix  whose  elements  assume  only  the  values  ±  1  and  whose  rows  And  columns 
are  mutually  orthogonal.  A  Hadamard  matrix  of  order  2  is 


(47) 


h(2)  = 


1  1 
1  -1 
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27  -  Circuit  for  computing  a  coefficient  of  a  series  expansion  of  a 

function  f  in  terms  of  orthogonal 'functions  assuming  the  values 


It  can  readily  be  shown  that 
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(48) 


,  r  h  Cn )  H\(n) 

A  ■  L  H(»)  -  h\»)  J 


2n 

a  0  +  0  ■  0  for  i  /  j  and  i  4  j  ±  n 

r1 

ZA*1  ak,j 

*»  n  +  n  a  2n  for  i  =  j 

=  n  -  n  *  0  for  i  »  j  ±  n 

is  a  Hadamard  matrix  of  order  2 n.  Obviously  all  elements  of  A  have  values 
±  1.  Furthermore,  the  inner  products  of  each  pair  of  columns  of  A  can  be 
considered  as  the  sum  of  two  inner  products  of  the  corresponding  columns  of 
the  n  x  n  Hadamard  matrices.  Thus  it  is  clear  that 


(49)  _ 

. _ =  n  -  n  » 

■  i  ■  ■■  m 

and  the  orthogonality  relations  for  the  column^  are  satisfied.  A  similar 
proof  holds  for  the  rows;  thus  all  requirements  for  A  to  be  Hadamard  are 
satisfied.  Equation  (49)  allows  Hadamard  matrices"  of  order  2®  to  b$  gen¬ 
erated  quite  readily.  |  ; 

Orthogonal  functions  suitable  for  expanding  ;a  function  over  the  interval 
[0,L]  derived  from  Hadamard  matrices  of  order  n  are  the  sets 

i  »  1,2,  —  n,  r'Xf, 

where  a  particular  function  R^^Cx)  assumes  the  values  of  appropriate 
elements  of  the  H'n'  matrix  on  sections  of  [the  interval  [0,L).  Specifically 

(50) 

*  ~ 

As  an  example,  consider  the  set  iRi^r* 
is  1  J 


L-I'Li  : 


A  Hadamard  matrix  of  order  8 


I  i 
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1  -1 


1  -1 


1  1 


1  -1 


-1  -1 


-1  1 


-1  -1 


-1  1 


1  1 


1  -1 


The  corresponding  set  of  functions]  Rj_^8^(x)  is  shown  in  Fig.  28. 


It  should  be  noted  that  there  is  np  linlque  Hadamard  matrix  of  order  8. 
The  one  considered  above  was  obtained  'b'y  starting  with  the  H^2)  given  by 
Eq.  (47)  and  repeatedly  applying  Eq.  (48).  However,  any  matrix  obtained 
from  that  of  Eq.  (51)  by  any  arbitrary  reordering\ of  rows  and/or  columns 
would  still  be  a  Hadamard  matrix.  Such  reordering  would,  of  course,  change 
the  order  and/or  form  of  the  derived  functions,  1^. 

A  given  integrable  function  f(x)  may  be  expanded  in  terms  of 
to  give  a  set  of  n  coefficients  where  \ 

.1  ]  '  X  \ 

(52)  C/n)  -  iJ  Lf(x)R  .(n)(x)dx.  ' 


In  view  of  Eq.  (50),  Eq.  (52)  may  be  rewritten  as 

°jtn)  -E 


J  n  f (x)dx  -  ^  ^(x), 


where  fj_(x)  is  the  average  value  of  f(x)  over  the  ith  interval. 
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Thcre^pcg 


n 


(55) 


or 


(56) 


L=1 


n 


c/n,4 1*1  jn)  ^  V(x> 


cj(n>4  I%,jCn)fiW  • 


i=l 


over 


It  is  clear  that  all  functions  f(x)  which  do  not  differ  in  average  value 

each  of  the  set  of  n  subintervals  L,  A  L  ,  i  «*  l-,2  — -  nj-,  will' 

yield  the  same  set  of  coefficients  (C^(n)).  Thus,  for  the  purpose  of  finding 
the  expansion  in  terms  of  the  functions  R^(n),  a  function  f(x)  is  equivalent 
to  the  piecewise  constant  function  F(x),  with  F(x)  defined  by 


I 


(57) 


f(x)  =  ^(x), 


x  <  i  L 
n  [  ^  n 


-k  A  given  function  F(x)  can  be  described  completely  by  specifying  a  vector 
V  of  n  components  such  that 


(58) 


*  F(x)  on 


i=iL,±L 

n  *  n 
■  \ 


or 


(59)  V1  »  f i (x) • 

Then  Eq.  jls6-)-may  be  rewritten  as 


(60) 


C  (^)  ss  “  )  u  (n)  y 

n  ^Hi,r  V 


i»l 


k9 


If  c 


.(») 


is  considered  as  the  jth'of  the  n  components  of  the  descriptor 


vector  C^n\  Eq.  (60)  is  seen  to  imply  ;  the  vector  equation 


(61) 


,(n)  1 


n 


\H' 


n>)TV. 


Since 


lH(n) 


n 


is  an  orthogonal  matrix,  Eq.  (61)  shows  that  the  descriptor 


vector  C<n> 
vectors 


is  the  vector  V  rotated!  by  some  fixed  ^amount.13  The  descriptor 
of  all  figures  can  be  obtained  from  t|ie  corresponding  vectors 

V  by  the  same  rotation."  That  means  that  the  vector's  derived  from  a 

set  of  different  figures  will  b,e_ar  exactly  the  same  relationship  to  each 
other  in  n-space  as  the  vectors  V  of  the  same  figures;  i.e.,  \tjhe  problem  of 
classifying  figures  on  the  basis  of  their  descriptor  vectors  is  exactly  the 
same  whether  the  vectors  given  are  of  the  type  Cvn2  or  of  the  type  V. 


The  orthogonal  functions  which  assume' only  values  ±  1  that  have  been 
considered  so  far  have  been  derived  from  Hadamard  matrices.'  Alternate  ways 
of  generating_^uch  functions  exist. 

Consider  first  the  system  of  Rademacher  functions  l  ^(x))*  defined 
on  the  interval  [0,1]  by  \ 


(62) 

where 

4k(x)  »  sign[sin(2kIDc)] 
(k  »  0,1,2* •  • ) , 

+1  for  x  >  0 

(63) 

sign  x  «  0  for  x  a  0 

*  v  * ,  -- 

_  —  — -  -1  for  x  <  0 

'  ’i  . 

These 

definitions  lead  to 

„  \  n 

0 

1 

forx  =  — 

2k 

(64) 

Vx)  s  1  I 

Ii  n  ^  n+1 

(-l)n 

for  gE  <  x  < 

(n  a 

0,1 ,  • *  *  2^  -  l) 

The  set  of  Rademacher  functions,  though  orthogonal,  is  not  complete, 
and  therefore  not  useful  for  expanding  arbitrary  functions » 
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A  complete  set  of  orthogonal  functions  that  may  be  derived  from  the 
Rademacher  functions  is  the  set  of  Walsh  functions  (¥k),  defined  on  the 
interval  [0,1]  by 


(65a) 


w  - 


(65c) 


Vx)  '  %+lW  4nr+l{x)’ 


n  a  ^ll+1  +  2n2+1  +  —  +  2nr+1 


with  n2  >  n2  >  •  •  •  >  nr,  where  the  4j(x)  are  Rademacher  functions.  The  first 
eight  Walsh  functions  are  shown  in  Fig.  29.  Comparison  with  the  functions 
( Hi  ’  / }  derived  from  the  Hadamard  matrix  of  order  8  given  in  Eq.  (51)  shows  lij  ■ 
that  the  two  sets  of  functions  are  identical,  though  differently 
ordered:  .(*  -  Rx(0);  ^  =  R5(8'}  »  R3(a5;  ¥3  »  R7t  )}  ¥4  »  R2(a); 

T5  -  V0);  ?6  =  R.(a)j  V7  =  R8(8)).  I  :  1  ;:  v 


It  is  trivial  to  show  that  all  Walsh  functions  can  be  derived  from 
appropriate  Hadamard  matrices.  Consider  jthe  Walsh  function  ¥n,  with  2m”1  <  i;|: 
n  <  2m,  m  >  1.  It  follows  from  the  definition  (Eq.  (65))  that  the  highest*1:  - 
order  Rademacher  function  occurring  in  thW  product  that  forms  Yn  is  Thire 

are,  therefore,  exactly  2™  subintervals  of  [0,1]  over  which  may  assume  a  .;V} v' 
distinct  constant  value;  ¥n  may  thus  be  described  completely  by  giving  itsjSjL; 
value  on  each  such  subinterval.  Let  Af(n)  be  the  value  of  Yn  on  the  ith 
such  interval,  when  1  <  i  <  2m.  Bjere  are  a  total  of  2m  Walsh  functions 
which  can  be  described  by  vectors  A\w  of*  2m  components.  Since  the  Walsh  . 
functions  are  orthogonal  on  [0,1],  W, 


J  ¥n(x)  fk^x)  dx  »  ol  n  4  k 


2m  2m 


^  J  Aj_(n)A^OOcbc  *  y\Aj;  (n)A^(tc)  *  0. 


1*1  i-i 

2m 


This  is  equivalent  to 


0,  n  /  k. 
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If  we  arrange  the  2m  vectors  a(J)  to-be  the  columns  of  a  matrix,  it 
will  be  Hadamard,  and  the  orthogonal  functions  (Rj_(2m)}  derived  J^rom  it 

will_o£_co«x&e'Jbe  the  Walsh  functions.  It  follows  that  expansion  in  W^lsh 
functions  is  no  more  than  a  rotation  of  the  simpler  descriptor  vector  V. 

The  n-component  vector  V  was  defined  by  kq.  (59)  to  be  merely  a  con¬ 
venient  notation  for  describing  a  piecewise  constant  approximation  to  a 
given  $(i)  function,  with  each  of  its  n  components  assuming  the  value  of 
the  average  of  the  function  over  one  nth  of  the  total  interval.  It  appears 
that  the  number  of  '.ubintervals  n  must  be  quite  large  for  the  approximation 
to  be  satisfactory;  i.e.,  the  vector  V  must  have  many  components.  It  is 
desirable  that  a  descriptor  vector  have  the  minimum  number  of  components 
consistent  with  providing  an  a  adequate  characterization  of  the  figure. 
Figure  30  shows  the  results  of  attempts  to  reconstruct  a  crude  silhouette 
of  a  B-70  aircraft  from  a  descriptor  vector  of  twelve  components.  These 
are  the  cosine  and  sine  coefficients  of  the  Fourier  series  expansion  of  the 
4>(i)  function  and  from  two  12-component  vectors  of  the  type  V  derived  from 
the  same  $(,£)  function  but  assuming  different  starting  points.  Comparison 
of  Fig. Figs.  30c  and  d  shows  that  on  the  whole  the  Fourier  series 
coefficients  axe  roughly  equivalent  as  a  set  of  descriptors  to  the  sets  of 
average  values  over  different  subinterval.  ^ 

However,  it  is  apparent  from  comparison  of'Vig.  30c  with  Fig»30d  that 
the  vectors  V  vary  significantly  with  the  startirig  point  on  the  contour. 

It  was  found  that  the  magnitude  of  the  vector  difference  between  the  vectors 
giving  rise  to  Figs.  30c  and  d  was  roughly  one- third  as  large  as  the  mag¬ 
nitude  of  the  vectors  themselves.  This  is  ja  very  large  difference  to  result 
from  displacing  the  starting  point  for  tracing  the  contour  only  one-twenty- 
fourth  of  the  total  length  of  the  contours  (half  the  length  of  one  sub¬ 
interval).  Descriptor  vectors  consisting  of  the  coefficients  of  the  Fourier 
series  expansion  of  the  <t>(i!)  function  also  change  with  a  change  in  starting 
point,  but  it  is  possible  to  perform  transformations  to  shift  the  virtual 
starting  point  at  will.  This  is  not  possible  with  descriptor  vectors  of  the 
type  V. 

Thus  it  appears  that  the  descriptor  vectors  considered  throughout  most 
of  this  report,  the  coefficients  of  the  Foufier  series  expansion  of  the 
<t>(X)  function,  are,  from  considerations  of  conciseness,  as  good  an  encoding 
of  the  information  contained  in  the  ${l)  function  as  the  alternatives  so  far 
investigated.  Furthermore,  they  have  properties  that  make  it  easy  to  change 
the  virtual  origin  for  the  4>(i)  function,  and  to  define  a  standard  origin 
for  all  figures  (see  Section  IV).  However,  expansion  in  Fourier  series, 
though  by  rio  means  prohibitively  complicated,  is  a  process  sufficiently 
involved  to  make  it  appear  worth  while  ■jxj  investigate  further  what  other 
descriptors  might  be  derived  by  alternate  encoding  of  the  $(£)  functions. 

The  $(i)  function  itself  appears  to  be  a  fundamental  and  useful  description 
of  shape.  ’  1  ■  1  ( 
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30  -  An  approximate  contour  representation  of  a  B70  aircraft  (a)  and 
reconstructions  of  the  contour  from  a  descriptor  vector  of  twelve 
components. .  Reconstruction  (b)  is  based  on  the  Fourier  coeffi¬ 
cients  of  the  first  6  harmonics;  (c)  and  (d)  are  based  on  des¬ 
criptor  vectors  of  the  type  V,  defined  by  Eq.  5 9>  with  different 
starting  points  on  the  contour  used  for  calculating  the  descriptor 
_  j  ettcFs . 
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Alternates  to  the  Fourier  series  expansion  of  the-  slope  function  are  also  con¬ 
sidered.  i 
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